home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
The American Multimedia Archive 3
/
The American Multimedia Archive - Volume 3.iso
/
lrsystem
/
morph.dat
< prev
next >
Wrap
Text File
|
1996-11-08
|
18KB
|
592 lines
; morph.dat v2.0e-002
; Copyright 1996 Logos Research Systems. All rights reserved.
;
; This is a morph definition file. Do not modify unless you understand the format!
; Format:
; Each morph definition (mdef): <ID>=<CHAIN>=<TAG>=<TEXT>\n
; where <ID> = pos. integer, so we can refer to the category later in this file;
; <CHAIN> = max. length of dependences in skip chains (see below);
; <TAG> = the morph tag string used by Westminster or GramCord;
; <TEXT> = the text to appear in the panel's lists.
; Note: the ID numbers are defined by this file and do not correspond to anything else.
; All spaces are relevant, not meaningless white space; keep spaces out of the ID, CHAIN, and TAG.
; Spaces around TEXT will be retained in the string.
; Mdefs are categorized using tab-indents; do not use spaces! The tab count must be exact.
; Level 0 (0 tabs)= language.
; Level 1 (1 tab)= "part of speech", or what might be called "morph trunk" (mtrunk).
; Level 2 (2 tabs)= morph categories (mcats) available under the morph trunk.
; Level 3 (3 tabs)= morph codes (mcodes) available under the morph category.
; Level 4 (4 tabs)= is SPECIAL: a list of comma delimited IDs, for the mcats/mcodes NOT used for this mcode (= "mskips").
; Semicolon (;) at the beginning of a line (only) marks a comment line.
;
; Note: Mcat ID number must indicate the correct order of categories in the morph tagging scheme. That is,
; they ensure that mcodes are sorted into the right order to make the final morph tag. Don't change them without
; being quite sure this sort order is unaffected!
; Largest ID allowed is 32000. 0 (zero) is a reserved ID.
;
; Hebrew: Note that "Any" cannot be used in a number of categories, because the '?' MUST
; match something. If it is used, it will miss tags that end before that slot, or have other codes filling
; that slot for certain tenses, etc., etc.
;
; CHAIN: Level 1 (mtrunk) requires a chain length number (after the ID), which tells how many times to iterate
; when updating the panel based on chains of dependencies among the level 2 mcats. This number enables
; one to have nested skips -- skips whose fields also have skips, etc. The chain length number
; says what is the longest chain of skip dependency there is among the mcats of an mtrunk (part of speech).
; Without this feature, the system operates as though chain length = 1. This means there is only one iteration
; of updating based on mskips. This entails that in certain situations of nested skips, some combinations of
; mcodes/mcats will not behave as desired. The following warnings and problems apply to this situation
; (i.e., ONLY if you omit CHAIN or set CHAIN=1):
;
; WARNING: DO NOT use level 4 mskip numbers referring to categories that also have level 4 skip numbers.
; That is, no nesting of skipping -- UNLESS all the subsequent/lower down skips are also explicitly skipped
; at the same time. That is, you can't skip an intermediary. The reason for this is that the intermediary
; category that is skipped may have needed to make other mcats/mcodes skipped too -- but these mcats/mcodes will
; remain enabled and visible, because the intermediary's skip-list is not used. E.g.: If I set Hebrew State such
; that Absolute skips Suffix (mcat), then mcodes under Suffix will not be used to skip subsequent mcats pertinent
; only to suffixes. That is, you may have pronominal suffix person, gender, and number fields enabled and visible,
; though you had selected Absolute state and thereby skipped the Suffix field (thus invisible). The reason behind
; this is that Suffix mcat in reality has certain properties SUBORDINATE to it, which really should not be of concern
; in the skip-lists of other mcats such as State. This subordination, however, is not represented here. The same
; problem will apply to individual mcodes that have subordinate mcats/mcodes to them.
;
; However, there is a particular situation where mcats/mcodes can skip MCODES that in turn have mskips.
; If mcode A skips mcode X from another field/mcat, and mcode X has various mskips, BUT there is also an mcode Y
; in X's field/mcat with the same mskips AND A does NOT skip Y, then the fact that X's mskips will still hang
; around when X is skipped is OK, because those mcats/mcodes need to be available for Y. E.g.: Hebrew Qal stem
; includes Passive Ptc. (and Participle [Active]), but Niphal skips Passive Ptc. BUT uses Participle. This is OK,
; because with Niphal selected, Passive Ptc. is not available, but its participle categories are available for Participle.
; Therefore, it is possible to skip an MCODE with subordinate mcats/mcodes to skip, IF those subordinate mcats/mcodes
; will still be needed anyway by another MCODE. This scenario will almost surely happen when these two such mcodes
; are in the same field/mcat. This is why we say this exception doesn't apply to mcats -- mcats with mskips should
; never be skipped. However, it is in theory possible for some mcode from another mcat to have the same subordinate
; mcats/mcodes to skip and to always be available for selection when the skipped mcode is invisible.
;
; If an mcode A (e.g., Niphal) skips mcode B (Pass. Ptc.), which in turn skips mcodes from A's mcat (Niphal - Hophal),
; then this is OK: Since A skips B, B won't disable anything from A's mcat. However, if A does not skip, but rather
; enables B, AND B skips A, then this is an error (result not defined).
;
; Order of data below: Hebrew, Greek
;
1000==LS_HEBREWMORPH=Hebrew
; 1010=1=x=Paragraph or Nun (x)
1400=3=v=Verb (v)
1410===Stem/Binyan
1411==?=Any (?)
1412==q=Qal (q)
1413==n=Niphal (n)
; The following skip of Pass. Ptc., which has mskips, is OK, because the its "forgotten" mskips will be used by Participle.
1449
1414==p=Piel,Poel,&c. (p)
1449
1415==P=Pual,Poal,&c. (P)
1449
1416==t=Hithpael,-poel,&c. (t)
1449
1417==h=Hiphil (h)
1449
1418==H=Hophal (H)
1449
1419==Q=Qal Passive (Q)
1449
1440===Tense/Aspect
1441==?=Any Finite (?)
1520
1442==p=Perfect (p)
1510,1520
1443==i=Imperfect (i)
1520
1444==w=Consec. Imperf. (w)
1520
1445==v=Imperative (v)
1480,1510,1520
1446==c=Infin. Constr. (c)
1480,1490,1500,1510,1520
1447==a=Infin. Absol. (a)
1480,1490,1500,1510,1520,1600,1700,1750,1800
1448==P=Participle (P)
1480,1510
1449==s=Pass. Ptc. [Qal] (s)
; Qal only!
1480,1510
1480===Person
1481==?=Any (?)
1482==1=First (1)
1483==2=Second (2)
1484==3=Third (3)
1490===Gender
1491==?=Any (?)
1492==m=Masculine (m)
1493==f=Feminine (f)
1494==c=Common (c)
1500===Number
1501==?=Any (?)
1502==s=Singular (s)
1503==p=Plural (p)
1504==d=Dual (d)
1510===Form
1511===Normal ()
1512==j=Apocopated (j)
1441,1442,1445,1446,1447,1448,1449
1513==h=Cohortative (h)
1441,1442,1445,1446,1447,1448,1449
1520===State
1493==c=Construct (c)
1492===Absolute ()
; Don't skip Suffix here, because State is sts. skipped.
1600,1700,1750,1800
; 1495==d=Determined (d) [Aramaic]
; Not supported yet!
1600===Suffix
1610===None ()
1700,1750,1800
1620==X=Pronominal (X)
1630==Xh=Paragogic He (Xh)
1700,1750,1800
; 1640==Xd=Directional He (Xd)
; 1700,1750,1800
1650==Xn=Paragogic Nun (Xn)
1700,1750,1800
1660==X?=Any Parag./Dir. (X?)
1700,1750,1800
1670==X*=Any (X*)
1700,1750,1800
1700===Sfx. Person
1710==?=Any (?)
1720==1=First (1)
1730==2=Second (2)
1740==3=Third (3)
1750===Sfx. Gender
1760==?=Any (?)
1770==m=Masculine (m)
1780==f=Feminine (f)
1790==c=Common (c)
1800===Sfx. Number
1801==?=Any (?)
1802==s=Singular (s)
1803==p=Plural (p)
1804==d=Dual (d)
1200=2=n=Noun (n)
1202===Class
1208==c=Common (c)
1210==p=Proper/Name (p)
1220,1230
; 1212==g=Gentilic (g) [Aramaic]
; 1220,1230
1220===Gender
1221==?=Any (?)
1222==m=Masculine (m)
1223==f=Feminine (f)
1224==b=Masc./Fem. (b)
1230===Number
1231==?=Any (?)
1232==s=Singular (s)
1233==p=Plural (p)
1234==d=Dual (d)
1240===State
1242===Absolute ()
; One might rather allow suffixes with Absolute, because the Westminster data may mistakenly include this.
1250,1260,1270,1280
1243==c=Construct (c)
; Suffixes should go with Construct forms, in the Westminster data, though there should be a status pronominalis.
; 1250,1260,1270,1280
; 1244===Suffixed ()
; This would be the status pronominalis.
; 1245==d=Determined (d) [Aramaic]
; Not yet supported!
; 1250,1260,1270,1280
1250===Suffix
1251===None ()
1260,1270,1280
1252==X=Pronominal (X)
1253==Xh=Paragogic He (Xh)
1260,1270,1280
1254==Xd=Directional He (Xd)
1260,1270,1280
; 1255==Xn=Paragogic Nun (Xn)
; 1260,1270,1280
1257==X?=Any Parag./Dir. (X?)
1260,1270,1280
1258==X*=Any (X*)
1260,1270,1280
1260===Sfx. Person
1261==?=Any (?)
1262==1=First (1)
1263==2=Second (2)
1264==3=Third (3)
1270===Sfx. Gender
1271==?=Any (?)
1272==m=Masculine (m)
1273==f=Feminine (f)
1274==c=Common (c)
1280===Sfx. Number
1281==?=Any (?)
1282==s=Singular (s)
1283==p=Plural (p)
1284==d=Dual (d)
1300=2=a=Adjective/Number (a)
1320===Gender
1321==?=Any (?)
1322==m=Masculine (m)
1323==f=Feminine (f)
1324==b=Masc./Fem. (b)
1330===Number
1331==?=Any (?)
1332==s=Singular (s)
1333==p=Plural (p)
1334==d=Dual (d)
1340===State
1342===Absolute ()
1350,1360,1370,1380
1343==c=Construct (c)
; 1344==d=Determined (d) [Aramaic]
; Not yet supported!
1350===Suffix
1351===None ()
1360,1370,1380
1352==X=Pronominal (X)
1353==Xh=Paragogic He (Xh)
1360,1370,1380
; 1354==Xd=Directional He (Xd)
; 1360,1370,1380
; 1355==Xn=Paragogic Nun (Xn)
; 1360,1370,1380
1357==X?=Any Parag./Dir. (X?)
1360,1370,1380
1358==X*=Any (X*)
1360,1370,1380
1360===Sfx. Person
1361==?=Any (?)
1362==1=First (1)
1363==2=Second (2)
1364==3=Third (3)
1370===Sfx. Gender
1371==?=Any (?)
1372==m=Masculine (m)
1373==f=Feminine (f)
1374==c=Common (c)
1380===Sfx. Number
1381==?=Any (?)
1382==s=Singular (s)
1383==p=Plural (p)
1384==d=Dual (d)
1900=2=pi=Pronoun (pi)
1901===Class
1903===Independent ()
1940,1950,1960,1970
1904==i=Interrogative (i)
1910,1920,1930
1910===Person
1915==?=Any (?)
1911==1=First (1)
1912==2=Second (2)
1913==3=Third (3)
1920===Gender
1925==?=Any (?)
1921==m=Masculine (m)
1922==f=Feminine (f)
1923==b=Masc./Fem. (b)
1930===Number
1935==?=Any (?)
1931==s=Singular (s)
1932==p=Plural (p)
1933==d=Dual (d)
1940===Suffix
1941===None ()
1950,1960,1970
1942==X=Pronominal (X)
; 1943==Xh=Paragogic He (Xh)
; 1950,1960,1970
1944==Xd=Directional He (Xd)
1950,1960,1970
; 1945==Xn=Paragogic Nun (Xn)
; 1950,1960,1970
1947==X?=Any Parag./Dir. (X?)
1950,1960,1970
1948==X*=Any (X*)
1950,1960,1970
1950===Sfx. Person
1951==?=Any (?)
1952==1=First (1)
1953==2=Second (2)
1954==3=Third (3)
1960===Sfx. Gender
1961==?=Any (?)
1962==m=Masculine (m)
1963==f=Feminine (f)
1964==c=Common (c)
1970===Sfx. Number
1971==?=Any (?)
1972==s=Singular (s)
1973==p=Plural (p)
1974==d=Dual (d)
1100=2=P=Particle (P)
1110===Class
1111==?=Any (?)
1116==d=Adverb (d)
1112==a=Article (a)
1140,1150,1160,1170
1114==c=Conjunction (c)
1140,1150,1160,1170
1124==o=Def. Dir. Obj. (o)
1143,1144
1120==i=Interjection (i)
1140,1150,1160,1170
1118==g=Interrogative (g)
1143,1144
1122==n=Negative (n)
1140,1150,1160,1170
1126==p=Preposition (p)
1143
1128==r=Relative (r)
1140,1150,1160,1170
1140===Suffix
1141===None ()
1150,1160,1170
1142==X=Pronominal (X)
1143==Xh=Paragogic He (Xh)
1150,1160,1170
1144==Xd=Directional He (Xd)
1150,1160,1170
; 1145==Xn=Paragogic Nun (Xn)
; 1150,1160,1170
1147==X?=Any Parag./Dir. (X?)
1150,1160,1170
1148==X*=Any (X*)
1150,1160,1170
1150===Sfx. Person
1151==?=Any (?)
1152==1=First (1)
1153==2=Second (2)
1154==3=Third (3)
1160===Sfx. Gender
1161==?=Any (?)
1162==m=Masculine (m)
1163==f=Feminine (f)
1164==c=Common (c)
1170===Sfx. Number
1171==?=Any (?)
1172==s=Singular (s)
1173==p=Plural (p)
1174==d=Dual (d)
2000==LS_GREEKMORPH=Greek
2010=2=V=Verb (V)
2020===Person
2021==?=Any (?)
2022==1=First (1)
2024==2=Second (2)
2026==3=Third (3)
2030===Number
2031==?=Any (?)
2032==S=Singular (S)
2034==P=Plural (P)
2040===Tense/Aspect
2041==?=Any (?)
2042==P=Present (P)
2044==I=Imperfect (I)
2046==F=Future (F)
2048==A=Aorist (A)
2050==R=Perfect (R)
2052==L=Pluperfect (L)
2055===Voice
2056==?=Any (?)
2057==A=Active (A)
2058==M=Middle (M)
2059==P=Passive (P)
2060===Mood/Mode
2061==?=Any Finite (?)
2080,2090,2100
2062==I=Indicative (I)
2080,2090,2100
2064==S=Subjunctive (S)
2080,2090,2100
2066==O=Optative (O)
2080,2090,2100
2068==M=Imperative (M)
2080,2090,2100
2070==N=Infinitive (N)
2020,2030,2080,2090,2100
2072==P=Participle (P)
2020,2030
2080===Gender
2081==?=Any (?)
2082==M=Masculine (M)
2084==F=Feminine (F)
2086==N=Neuter (N)
2090===Number
2091==?=Any (?)
2092==S=Singular (S)
2093==P=Plural (P)
2100===Case
2101==?=Any (?)
2102==N=Nominative (N)
2104==G=Genitive (G)
2106==D=Dative (D)
2108==A=Accusative (A)
2110==V=Vocative (V)
2200=1=N=Noun (N)
2220===Gender
2221==?=Any (?)
2222==M=Masculine (M)
2224==F=Feminine (F)
2225==N=Neuter (N)
2230===Number
2231==?=Any (?)
2232==S=Singular (S)
2234==P=Plural (P)
2240===Case
2241==?=Any (?)
2242==N=Nominative (N)
2244==G=Genitive (G)
2246==D=Dative (D)
2248==A=Accusative (A)
2249==V=Vocative (V)
2300=1=A=Article (A)
2320===Gender
2321==?=Any (?)
2322==M=Masculine (M)
2324==F=Feminine (F)
2325==N=Neuter (N)
2330===Number
2331==?=Any (?)
2332==S=Singular (S)
2334==P=Plural (P)
2340===Case
2341==?=Any (?)
2342==N=Nominative (N)
2344==G=Genitive (G)
2346==D=Dative (D)
2348==A=Accusative (A)
2349==V=Vocative (V)
2400=1=J=Adjective (J)
2420===Gender
2421==?=Any (?)
2422==M=Masculine (M)
2424==F=Feminine (F)
2425==N=Neuter (N)
2430===Number
2431==?=Any (?)
2432==S=Singular (S)
2434==P=Plural (P)
2440===Case
2441==?=Any (?)
2442==N=Nominative (N)
2444==G=Genitive (G)
2446==D=Dative (D)
2448==A=Accusative (A)
2449==V=Vocative (V)
2450===Degree
2451==?=Any (?)
2452==X=None/Positive (X)
2454==C=Comparative (C)
2456==S=Superlative (S)
2500=1=B=Adverb (B)
2510===Degree
2521==?=Any (?)
2520==X=None/Positive (X)
2530==C=Comparative (C)
2540==S=Superlative (S)
2600=1=C=Conjunction (C)
2610===Class
2621==??=Any (??)
2630
2622==C=Coordinating (C)
2662,2663,2664,2666,2668,2670,2672,2674,2676
2623==S=Subordinating (S)
2632,2633,2634,2636,2638,2640,2642,2644
2630===Subclass
2631==?=Any (?)
2632==B=Continuative (B)
2633==C=Correlative (C)
2634==D=Disjunctive (D)
2636==G=Interrogative (G)
2638==K=Copulative (K)
2640==Q=Inferential (Q)
2642==X=Explanatory (X)
2644==V=Adversative (V)
;
2662==C=Causal (C)
2663==E=Conditional (E)
2664==G=Interrogative (G)
2666==L=Locative (L)
2668==N=Nominal (N)
2670==P=Comparative (P)
2672==R=Result (R)
2674==S=Concessive (S)
2676==T=Temporal (T)
; There are extensions to the C/Conj. Codes in CCAT, for "plus ...", etc.
2700=1=I=Interjection (I)
2800=1=T=Particle (T)
2810===Class
2821==?=Any (?)
2822==A=Alternating (A)
2823==F=Indefinite (F)
2824==G=Interrogative (G)
2826==M=Emphatic (M)
2828==N=Negative (N)
2830==P=Comparative (P)
2832==S=Subjective (S)
2834==U=Modal/Uncertainty (U)
2900=1=P=Preposition (P)
2910===Case of Object
2921==?=Any (?)
2922==G=Genitive (G)
2923==D=Dative (D)
2924==A=Accusative (A)
3000=1=M=Improper Prep. (M)
3010===Case of Object
3021==?=Any (?)
3022==G=Genitive (G)
3023==D=Dative (D)
3024==A=Accusative (A)
3100=1=O=Pronoun (O)
3110===Class
3121==??=Any (??)
3150
3122==A=Adjectival (A)
3160,3162,3164
3123==P=Personal (P)
3154,3156,3158
3124==RR=Relative (RR)
3150
3126==IF=Indefinite (IF)
3150
3127==IR=Indefinite Relative (IR)
3150
3128==CR=Correlative (CR)
3150
3130==DE=Demonstrative (DE)
3150
3132==S=Possessive (S)
3152,3154,3156,3158,3164
3134==X=Reflexive (X)
3152,3154,3156,3158
3136==RC=Reciprocal (RC)
3150
3138==IG=Interrogative (IG)
3150
3150===Subclass
3151==?=Any (?)
3152==I=Intensive (I)
3154==D=Demonstrative (D)
3156==G=Interrogative (G)
3158==F=Indefinite (F)
3160==1=First Person (1)
3162==2=Second Person (2)
3164==3=Third Person (3)
3200===Gender
3211==?=Any (?)
3212==M=Masculine (M)
3213==F=Feminine (F)
3214==N=Neuter (N)
3240===Number
3241==?=Any (?)
3242==S=Singular (S)
3244==P=Plural (P)
3270===Case
3271==?=Any (?)
3272==N=Nominative (N)
3274==G=Genitive (G)
3276==D=Dative (D)
3278==A=Accusative (A)
3280==V=Vocative (V)